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Abstract. Gessel walks are lattice walks in the quarter plane N 2 which start at the ori- 
gin (0,0) £ N 2 and consist only of steps chosen from the set {«• . /, / . ••»}. We prove that 
if g(n;i,j) denotes the number of Gessel walks of length n which end at the point £ N 2 , 

then the trivariate generating series G(t;x,y) = g(n;i,j)x t yH n is an algebraic function. 



1. Introduction 

The starting question in lattice path theory is the following: How many ways are there to walk 
from the origin through the lattice Z 2 to a specified point £ Z 2 , using a fixed number n of 
steps chosen from a given set S of admissible steps. The question is not hard to answer. If we 
write f(n;i,j) for this number and define the generating function 

oc 

F(t;x,y) := E(E /("5M>V)« n e Q[a;,2/,a; _1 ,y _1 ][[*]] 

n— ij'GZ 

then a simple calculation suffices to see that F(t;x,y) is rational, i.e., it agrees with the series 
expansion at t = of a certain rational function P/Q G Q.(t, x, y). This is elementary and well- 
known. 

Matters are getting more interesting if restrictions are imposed. For example, the generating 
function F(t] x, y) will typically no longer be rational if lattice paths are considered which, as 
before, start at the origin, consist of n steps, end at a given point but which, as an additional 

requirement, never step out of the right half-plane. In was shown in [8, Prop. 2] that no matter 
which set S of admissible steps is chosen, the complete generating function F for such walks is 
algebraic, i.e., it satisfies P(F, t, x, y) = for some polynomial P G Q[T, t, x, y]. 

If the walks are not restricted to a half-plane but to a quarter plane, say to the first quadrant, 
then the generating function F might not even be algebraic. For some step sets it is, for others 
it is not [6, 23]. Among the scries which are not algebraic, there are some which are still D-finitc 
with respect to t (i.e., they satisfy a linear differential equation in t with polynomial coefficients 
in Q[t, x, y\), and others which are not even that [8, 24]. 

Bousquet-Mclou and Mishna [7] have systematically investigated all the walks in the quarter 
plane with step sets S C {<— , \, f, /*, — >, \, J., After discarding trivial cases and applying 
symmetries, they reduced the 256 different step sets to 79 inherently different cases to study. They 
provided a unified way to prove that 22 of those are D-finite, and gave striking evidence that 56 
are not D-finite. Only a single step set sustained their attacks, and this is the step set that we are 
considering here. 

This critical step set is {*— , / , /*, — ►}. The central object of the present article are thus lattice 
walks in Z 2 which 

• start at the origin (0,0), 

• consist of n steps chosen from the step set {<— , / , — >}, and 

• never step out of the first quadrant N 2 of 1? . 
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These walks are also known as Gessel walks. By g(n;i,j), we denote the number of Gessel walks 
of length n which end at the point £ 1? . The complete generating function of this sequence 
is denoted by 

oo 

G(t;x,y) = J2(Yl s(»;*"»J>V)* n . 

n=0 ij£Z 

Since g(n; = if min(i, j) > n or max(i, j) < 0, the inner sum is a polynomial in x and y for 
every fixed choice of n, and thus G(t; x, y) lives in Q[x, y] [[t]]. 

Gessel [unpublished] considered the special end point i = j = 0, i.e., Gessel walks returning to 
the origin, so-called excursions. Their counting sequence g(n; 0,0) starts as 

1, 0, 2, 0, 11, 0, 85, 0, 782, 0, 8004, 0, 88044, 0, 1020162, 0, . . . 

He observed empirically that these numbers admit a simple hypergeometric closed form. His 
observation became known as the Gessel conjecture, and remained open for several years. Only 
recently, it was shown to be true: 



Theorem 1. [16] G(t;0,0) = 3 F 2 



(5/6 1/2 1 
I 5/3 2 



This result obviously implies that G(t; 0, 0) is D-finitc. Less obvious at this point, and actually 
overlooked until now, is the fact that the power series G(i; 0, 0) is even algebraic. Because of the 
alternative representation 



3-^2 



(5/6 1/2 1 
I 5/3 2 



4(H~V /2 



it was clear that algebraicity could be decided by reference to Schwarz's classification [30] of 
algebraic 2F1S, but as nobody recognized that the parameters (—1/6, —1/2; 2/3) actually fit to 
Case III of Schwarz's table, the rumor started to circulate that G(t; 0, 0) is not algebraic. In fact: 

Corollary 2. G(t;0, 0) is algebraic. 

With Theorem 1 and standard software packages like gfun [29, 21] at hand, discovering and 
proving Cor. 2 is an easy computer algebra exercise. Compared to a proof by table-lookup, the 
constructive proof given below has the advantage that it applies similarly also for families of 
functions for which classification results are not available. 

Proof. The idea is to come up with a polynomial P(T,t) in Q[T, t] and prove that P admits the 
power series g(t) = J2^ =0 ^5/3) ^ (2^" (16t)" a s a root. Using Thm. 1, this implies that P(T,t 2 ) is 
an annihilating polynomial for G(i;0,0), so that the latter series is indeed algebraic. 

Such a polynomial P can be guessed starting from the first, say, 100 terms, of the series g(t), 
using for instance Maple's routine seriestoalgeq from the gfun package (see Sections 2.1 and 3.1 for 
more details on automated guessing). The explicit form of P is given below. 

By the implicit function theorem, that polynomial P admits a root r{t) £ Q[[t]] with r(0) = 1. 
Since P(T, 0) = T — 1 has a single root in C, the series r(t) is the unique root of P in C[[t]]. Now, 
r(t) being algebraic, it is D-finite, and thus its coefficients satisfy a recurrence with polynomial 
coefficients. To complete the proof, it is then sufficient to type the following commands into Maple. 

> with (gfun) : 

> P:=(t,T) -> -l+48*t-576*t-2-256*t-3+(l-60*t+912*t~2-512*t~3)*T+(10*t 

-312*t~2+624*t~3-512*t"4)*T"2+(45*t"2-504*t~3-576*t"4)*T"3+(117*t"3 
-252*t~4-288*t~5)*T~4+189*t"4*T"5+189*t~5*T~6+108*t"6*T"7+27*t~7*T~8: 

> gfun:-diff eqtorec(gfun:-algeqtodiff eq(P(t,r) , r(t)), r(t), g(n)); 

This outputs the first-order recurrence 

(n + 2)(3n + 5)g n+1 - 4(6n + 5)(2n + l)g n = 0, g Q = 1, 

satisfied by the coefficients of r(t) = 9nt n - Its solution is g n = ^5/3) ^2^" an< ^ therefore 

g(t) and r(t) coincide, and thus git) is a solution of P, as was to be shown. □ 
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The aim in the present article is to lift the result of Corollary 2 to the complete generating 
function, where x and y are kept as parameters. We are going to show: 

Theorem 3. G(t;x,y) is algebraic. 

This twofold generalization of Thm. 1 is a surprising result. Until now, it was not known whether 
G(t; x, y) is even D-finite with respect to t or not, and both cases seemed equally plausible in view 
of known results about other step sets. Thm. 3 implies that G(t; x, y) is D-finitc with respect 
to each of its variables, and in particular that the sequence g(n;i,j) is P-finitc (i.e., it satisfies 
a linear recurrence with polynomial coefficients in n) for any choice of (i,j) € N 2 . This settles 
several conjectures made by Petkovsek and Wilf in [26, §2]. As noted in [26], even for simple values 
of the sequence g(n;i,j) is not hypergeometric, unlike the excursions sequence g(2n;0,0). 
For instance, the sequence g(2n + 1; 1, 0) satisfies a third order linear recurrence, but it is not 
hypergeometric. Moreover, no closed formula seems to exist for g(n]i,j), for arbitrary All 
this indicates that counting general walks is much more difficult that just counting excursions. 

Theorem 3 will be established by obstinately using the approach based on automatic guessing 
and proof promoted in [5], and by making heavy use of computer algebra. In contrast to Corol- 
lary 2, we manage in our proof of Theorem 3 to avoid exhibiting a polynomial that has G(t; x, y) 
as a root. This is fortunate, since a posteriori estimates show that the minimal polynomial of 
G(t; x, y) is huge, having a total size of about 30Gb. 

Only annihilating polynomials of the section series G(t; x, 0) and G(t; 0, y) are produced and 
manipulated during the computer-driven proof of Theorem 3. But even restricted to those ones, 
our computations have led to expressions far too large to be included into a printed publication; 
too large even to be processed efficiently by standard computer algebra systems like Maple or 
Mathematica. To get the computations completed, it was necessary to use careful implementations 
of sophisticated special purpose algorithms, and to run these on computers equipped with fast 
processors and large memory capacities. These computations were performed using the computer 
algebra system Magma [2]. Our result is therefore interesting not only because of its combinatorial 
significance, but it is also noteworthy because of the immense computational effort that was 
deployed to establish it. 

2. A Dry Run: Kreweras walks 

The computations which were needed for proving Thm. 3 were performed by means of efficient 
special purpose software running on fast hardware. It would not be easy to redo these calculations 
in, say, Maple or Mathematica on a standard computer. As a more easily reproducible calculation, 
we will show in this section how to reprove the classical result that the generating function of 
Kreweras walks is algebraic [19, 14, 6]. A slight variation of the very same reasoning, albeit with 
intermediate expressions far too large to be spelled out here, is then used in the next section to 
establish Thm. 3. 

Kreweras walks differ from Gessel walks only in their choice of admissible steps. They are thus 
defined as lattice walks in 1? which 

• start at the origin (0,0), 

• consist only of steps chosen from the step set {<— , J., /*}, and 

• never step out of the first quadrant N 2 of Z 2 . 

If f(n;i,j) denotes the number of Kreweras walks consisting of n steps and ending at the point 
(i,j) £ Z 2 , then it follows directly from its combinatorial definition that the sequence f(n;i,j) 
satisfies the multivariate recurrence with constant coefficients 

(1) f(n + = f(n;i + 1, j) + f{n;i,j + 1) + f(n;i - 1, j - 1), 

for all n,i,j > 0. Together with the boundary conditions f(n;— 1,0) = /(n;0, — 1) = (n > 0) 
and /(0;i, j) = 5i j t a (i,j > 0), this recurrence equation implies the functional equation 

F(t; x, y) = 1 + (i + i + xy)tF{t; x, y) - HF(t; x, 0) - HF(t; 0, y) 
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for the generating function 

oo oo 

F(t;x,y) =J2(Y1 /(»;<,J>V)* n . 

n— 

Noting that F(t;0,y) and F(t;y,0) are equal by the symmetry of the step set about the main 
diagonal of N 2 , the last equation becomes 

F(t; x, y) = 1 + (i + i + iy)tF(t; a:, y) - ±iF(i; .t, 0) - HF(t; y, 0). 

At the heart of our next arguments is the kernel method, a method commonly attributed to 
Knuth [17, Solutions of Exercises 4 and 11 in §2.2.1] which has already been used to great advantage 
in lattice path counting, see e.g. [12, 27, 6]. After bringing the functional equation for F(t;x,y) 
to the form 

(K) ((x + y + x 2 y 2 )t - xy)F(t; x, y) = xtF(t; x, 0) + ytF(t; y, 0) - xy, 

the kernel method consists of coupling x and y in such a way that this equation reduces to a 
simpler one, from which useful information about the section series F(t;x,0) can be extracted. 
In the present case, the substitution 



x - t - V-4i 2 a: 3 + x 2 - 2tx + t 2 

y -> Y{t, x) = — 

2tx z 

= t + \t 2 + 4^i 3 + ^ii 4 + 2a6 + 6 f 3+1 t 5 + • • • e Q[x, x- 1 }^]}, 

which is legitimate since the power series Y(t, x) has positive valuation, puts the left hand side 
of (K) to zero, and therefore shows that U = F(t; x, 0) is a solution of the reduced kernel equation 

(K red ) U(t, x) = - ^E/(t, Y(t, x)). 

t x 

Now, the key feature of Equation (K re( j) is that its unique solution in Q[[x, t]] is U = F(t; x, 0). 
This is a consequence of the following easy lemma. Here, and in the rest of the article, ord„ S 
denotes the valuation of a power series S with respect to some variable v occurring in S. 

Lemma 4. Let A,B,Ye Q[x, a;~ 1 ][[t]] be such that ordt B > and ordt Y > 0. Then there exists 
at most one power series U G Q[[x, t]] with 

U(t, x) = A(t, x) + B(t, x) ■ U(t, Y(t, x)). 

Proof. By linearity, it suffices to show that the only solution in Q[[x, t]] of the homogeneous 
equation U(t, x) = B(t, x) ■ U(t, Y(t, x)) is the trivial solution [7 = 0. This is a direct consequence 
of the fact that if U were non-zero, then the valuation of B(t,x) ■ U(t,Y(t, x)) would be at least 
equal to ordt B + ordt U, thus strictly greater than the valuation of U(t, x), a contradiction. □ 

We are now ready to reprove the following classical result. 

Theorem 5. [14] F(t;x,y) is algebraic. 

Proof. The strategy is to use a computer-assisted proof, which is completed in two steps: 

(1) Guess an algebraic equation for the series F(t;x,0), by inspection of its initial terms. 

(2) Prove that 

(a) the equation guessed at Step (1) admits exactly one solution in Q[[x, t}], denoted 
F can d(t;x,0); 

(b) the power series U = F can d(t; x, 0) satisfies (K re( j). 

Once this has been accomplished, the fact that U = F(t;x,0) also satisfies Equation (K re d), in 
conjunction with Lemma 4 (with the choice A(t, x) =Y(t,x)/t a.nd B(t,x) = — Y(t, x)/x), implies 
that the power series F can a(t; x, 0) and F(t;x, 0) coincide. 

In particular, F(t;x,0) satisfies the guessed equation, and this certifies that F(t;x,0) is an 
algebraic power series. Since Y(t,x) is algebraic as well, and since the class of algebraic power 
series is closed under addition, multiplication and inversion, it follows from (K) that F(t;x,y) is 
algebraic, too. This concludes the proof. □ 
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In the rest of this section, we supply full details on the automated guessing step (1) and on the 
proving steps (2a) and (2b). 

2.1. Guessing. Given the first few terms of a power series, it is possible to determine potential 
equations that the power series may satisfy, for example by making a suitable ansatz with undeter- 
mined coefficients and solving a linear system. In practice, either Gaussian elimination, or faster, 
special purpose algorithms based on Hcrmite-Pade approximation [1], are used. The computation 
of such candidate equations is known as automated guessing and is one of the most widely known 
features of packages such as Maple's gfun [29]. 

If sufficiently many terms of the series are provided, automated guessing will eventually find an 
equation whenever there is one. The method has two possible drawbacks. First, it may in principle 
return false equations (although, if applied properly, it virtually never does so in practice). This 
is why - in order to provide fully rigorous proofs - equations discovered by this method must be 
subsequently proven by an independent argument. Second, if the precision needed to recover the 
equations is very high, the guessing computations could take extremely long when using traditional 
software. This is typically the case in the Gessel example treated in Section 3, for which dedicated, 
very efficient, algorithms are needed. 

In the Kreweras case, the computations are feasible in Maple. We now provide commented code 
which enables the discovery of an algebraic equation potentially satisfied by F(t;x,0). First, a 
function / is defined which computes the numbers f(n;i,j) via the multivariate recurrence (1). 

> f :=proc(n,i, j) 
option remember; 

if i<0 or j<0 or n<0 then 

elif n=0 then if i=0 and j=0 then 1 else fi 
else f (n-l,i-l, j-l)+f (n-l,i,j+l)+f (n-l,i+l,j) fi 
end : 

Using this function, we compute the first 80 coefficients of F(t; x, 0); they are polynomials in x 
with integer coefficients. The resulting truncated power series is stored in the variable S. 

> prec:=80: 

> S:=series(add(add(f (k,i,0)*x~i,i=0. .k)*t~k,k=0. .prec) ,t,prec-l) : 

Next, starting from S, the gfun guessing function seriestoalgeq discovers a candidate for an al- 
gebraic equation satisfied by F(t; x, 0). For efficiency reasons, we do not use the built-in version of 
gfun, but a recent one which can be downloaded from http : / / algo . inria. fr /libraries/paper s/gfun. html 

> gfun:-seriestoalgeq(S,Fx(t)) : 

> P:=collect(numer(subs(Fx(t)=T,7„[l] )) ,T) ; 

The guessed polynomial reads: 

P(T, t, x) = (16x 3 i 4 + 108i 4 - 72xt 3 + 8x 2 t 2 -2t + x) 

+ (96x 2 t 5 - A&xH 4 - 144i 4 + 104xi 3 - \%x 2 t 2 + 2t- x)T 

+ (48a; 4 t 6 + 192xt 6 - 264x 2 i 5 + 64x 3 i 4 + 32i 4 - 32xt 3 + 9x 2 t 2 )T 2 

+ (I92x 3 t 7 + 128i 7 - 96x 4 t 6 - I92xt 6 + 128x 2 t 5 - 32x 3 t 4 )T 3 

+ (48x 5 i 8 + 192a; 2 t 8 - 192x 3 t 7 + 56x 4 t 6 )T 4 

+ (96x 4 t 9 - A8x 5 t 8 )T 5 + 16x 6 t w T 6 . 

Running Maplel2 on a modern laptop 1 , the whole guessing computation requires about 80Mb 
of memory and takes less than 20 seconds. Once the candidate polynomial P is guessed, one could 
proceed to its empirical certification; this can be done in various ways, as explained in [5]. We do 
not need to do this here, since we are going to prove in §2.2 that F(t; x, 0) is a root of P. 



MacBook Pro; Intel Core 2 Duo Processor, @2.4 GHz; 4Mb cache, 2Gb RAM. 
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One may wonder where the precision 80 used in the previous computations comes from. Here, 
this precision was humanly guessed, being chosen as a reasonable threshold. However, a straight- 
forward doubling technique (not explained here in detail) would allow to automatically tune it, 
by running several times the whole guessing procedure with increasing precision until the same 
polynomial is output two consecutive times. 

2.2. Proving. In this section, we detail the two steps (2a) and (2b) used in the proof of Theorem 5. 

2.2.1. Existence and Uniqueness. Since P(l,0,x) = and ^(1,0, x) = —x, the implicit function 
theorem implies that P admits a unique root F can ^(t; x, 0) in Q((x))[[t]]. It follows that P has at 
most one root in Q[[x,t]] and that this root, if it exists, belongs to Q[x, x _1 ][[i]]. 

Proving the existence of a root of P in Q[[a;, t]] is less straightforward: this time, the equalities 
P(l, 0, 0) = and §f^(l, 0, 0) = prevent us from directly invoking the implicit function theorem. 

We are thus faced to a clumsy technical complication, since what we really need to prove is that 
the root F C and(i; x, 0) actually belongs to Q[[x, t]]: otherwise, the substitution of U = F canc i(i; x, 0) 
in Equation (K re( j), used in Step (2b) of the proof of Thm. 5, would not be legitimate. 

To circumvent this complication, we exploit the fact that, when seen in Q(x)[T, £], the polyno- 
mial P(T, t, x) defines a curve of genus zero over Q(x), which can thus be rationally parameterized. 
Precisely, using Maple's algcurves package, the rational functions Ri(U,x) and R,2(U,x) defined 
by: 

17(1 + U)(l + 2Z7 + U 2 + U 2 x) 2 



Ri(U,x) 



R 2 (U,x) 



h(U, x) 

(U 4 x 2 + 2U 2 (U + l) 2 x + 1 + 4t7 + 6U 2 + 2U 3 - U 4 )h(U, x) 



(1 + [/) 2 (1 + 2J7 + U 2 + [/ 2 x) 4 
with 

h(U, x) = U e x 3 + 3U A (U + lfx 2 + 3U 2 {U + l) 4 x + 1 + 6U + 15U 2 + 24{/ 3 + 27C/ 4 + 18C/ 5 + 5U 6 , 
are found to share the following properties: 

• P(R 2 (U,x),R 1 (U,x),x) = 0; 

• there exists a (unique) power series 

U (t, x) = t + t 2 + (x + l)t 3 + (2x + 5)i 4 + (2x 2 + 3x + 9)t 5 + ... 
in Q[[x, t]] such that R x (U ,x) = t and J7 (0, x) = 0. 
While the first property is easily checked by direct calculation, the second one is a conse- 
quence of the implicit function theorem, since Q{U, t, x) = Ri(U, x) — t satisfies Q(0, 0, 0) = and 
fg(0,0,0) = 1. 

The existence proof of a power series solution of P is then completed using the following 
argument: i?2 having no pole at U = 0, and the valuation of Uq with respect to t being positive, 
the composed power series i?2(C7 (i, x), x) is well defined in Q[[x,i]] and it satisfies 

P(R 2 (U ,x),t,x) =P(R 2 (Uo,x),R 1 (U ,x),x) =0. 

Therefore, F can( j(i; x, 0) = R2(Uo(t,x),x) is the unique power scries solution in Q[[x,<]] of P. 

2.2.2. Compatibility with the reduced kernel equation. We need to show that F can d(i;x,0) so de- 
fined satisfies equation (K re d). This can be done in various ways by resorting to closure properties 
for algebraic power series. These closure properties are performed by means of resultant compu- 
tations, based on Lemma 6 below. 

One possibility is to first prove that the power series S(t, x) £ Q[x, x _1 ] [[t]] defined by 

s(t,x) = 3[M fanjfty(t ) ()) 

t X 

is a root of the polynomial P(T,t,x), and then to use the fact that P has only one root in 
Q[x,x _1 ][[t]], namely F C nnd(t] x, 0). This will imply that S(t,x) and F can( j(i; x, 0) coincide, and 
thus that -F can d(t; x, 0) satisfies equation (K ro( j), as desired. 
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The main point of this approach is that, since the power series Y(t,x) and F clLnc i(t] x , 0) are 
both algebraic, finding a polynomial which annihilates the series S(t, x) can be done in an exact 
manner, without having to appeal to guessing routines. Moreover, the minimal polynomial of 
S(t, x) can be determined by factoring an annihilating polynomial obtained through a resultant 
computation, and, if necessary, by matching the irreducible factors against the initial terms of the 
series S(t, x). 

More precisely, one can use the following classical facts, that we recall for completeness, see 
e.g. [20] for a proof. 

Lemma 6. Let K be a field and let P,Q £ K[T, t, x] be annihilating polynomials of two algebraic 
power series A, B in K[x, x~ ][[£]]. Then 

(1) pA is algebraic for every p £ K(t, x), and it is a root of p de6T p P(T/p, t, x). 

(2) A±B is algebraic, and it is a root oftes z (P(z,t,x),Q(±(T — z),t,x)). 

(3) AB is algebraic, and it is a root of res z (P(z,t,x), z degT< ^Q(T/z,t,x)). 

(4) If ovd x B > 0, then A(t, B(t,x)) is algebraic, and it is a root of res z (P(T,i, z),Q(z,t, x)). 

Since z/t — z/xF cim d(t; z, 0) is a root of (the numerator of) P(x/z(z/t—T), t, z) and since Y(t, x) 
is a root of (x + T + x 2 T 2 )t — xT. Lemma 6 suggests continuing our Maple session by constructing 
a polynomial in Q[T, t, x] which has S(t, x) as a root, in the following way: 

> ker := (T,t,x) -> (x+T+x~2*T~2)*t-x*T: 

> pol := unapply(P,T,t,x) : 

> res := resultant (numer (pol (x/z* (z/t -T) ,t,z)) , ker(z,t,x), z) : 

> factor (primpart (res ,T) ) ; 

The output of the last line is P(T, t, x) 2 , which proves that S(t, x) is a root of P(T, t, x). 

2.3. Consequences. Setting x to in P leads to the conclusion that the generating series 
F(t; 0, 0) of Kreweras excursions is a root of the polynomial 6At 6 T 3 + 16t 3 T 2 + T- 72i 3 T + 54i 3 - 1. 
An argument similar to that used in the proof of Corollary 2 then implies that the coefficients a n 
of F(t; 0, 0) satisfy the linear recursion 

(n + 6)(2n + 9)a n+3 - 54(n + 2)(n + l)a n = 0, a = 1, a x = 0, a 2 = 0, 

which in turn provides an alternative proof of the classical fact [19, 14, 6] that the series F(t; 0, 0) 
is both algebraic and hypergeometric, and it has the following closed form 

A/3 2/3 1 



F(t;0,0) = 3 F 2 



au (3n\ 
\ ^ V n / +3n 



\ 3/2 2 

3. GESSEL WALKS 



27t 6 ) = > -t 

^J(n+l)(2n+l) 



For establishing the proof of Theorem 3, we apply essentially the same reasoning that was 
applied in the previous section for proving Theorem 5. The main difference is that the intermediate 
expressions get very big, so that they can only be handled by special purpose software (see the 
data provided on our website [4]). There are also some additional complications which require to 
vary the arguments slightly. In this section, we point out these complications, describe how to 
circumvent them, and we document our computations. 

The numbers g(n,i,j) of Gessel walks of length n ending at (i,j) £ Z 2 satisfy the recurrence 
equation 

g(n+l;i,j) = g(n;i- 1, j - 1) +g(n;i+ l,j + 1) +g(n;i- l,j)+g(n;i + 

for n,i,j > 0. Together with appropriate boundary conditions, this equation implies that the 
generating function 

oo oo 

G(t; x, y) = ^2 ( X! i) x V 

n— i,j=0 

which we seek to prove algebraic, satisfies the equation 

(K G ) ((1 + y + x 2 y + x 2 y 2 )t - xy)G(t; x, y) = (1 + y)t G(t; 0,y)+t G(t; x, 0) - t G(t; 0, 0) - xy. 
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This is the starting point for the kernel method. 

In this case, because of lack of symmetry with respect to x and y, there are two different ways 
to put the left hand side to zero, using the two substitutions 



y -> Y(t, x) := - (tx 2 -x + t+ ^ {tx 2 - x + t) 2 - At 2 x 2 ) /(2tx 2 ) 

= If -f x2 + 1 t 2 + St+Sx—Lll 3 _|_ g 6 +6a: 4 +63: 2 + 1 ^4 _| 



and x X{t, y) := (y - y/y(y - At 2 {y + l) 2 ))/(2ty(y + 1 )) 

_ V+l t , (j/+l)% 3 , 2(y+l) 5 ,5 , 5fa+l) 7 ,7 , 

v v 2 y 3 y 4 

They yield the equations 

G(i; 0) = a;y(t, a;)/* + G(i; 0, 0) - (1 + Y(t, x))G(t; 0, F(t, x)), 

rod (l+y)G(i;0, 2 /)=X(t,y)y/i + G(i;0,0)-G(t;X(t,y),0) ! 

respectively. Note that the first equation is free of y while the second is free of x. If we rename 
y to x in the second equation, then all quantities belong to Q[x, x" 1 } [[t]]. Note also that we can 
write G(t; x, 0) = G(t; 0, 0) + xU (t, x) and G(t; 0, x) = G(t; 0, 0) + xV(t, x) for certain power series 
U,V £ Q[[x,t]}. In terms of U and V, the two equations above are then equivalent to 

G 2 xU{t, x) = xY(t, x)/t - (1 + Y(t, x))G(t; 0, 0) - Y(t, x)(l + Y(t, x))V{t, Y(t, x)), 

^ Kped ' (1 + x)xV(t, x) = X(t, x)x/t - (1 + x)G(t; 0, 0) - X(t, x)U(t, X(t, x)). 

The two equations (K re ^) correspond to the equation (K re( j) in Section 2. The situation here is 
more complicated in two respects. First, we have two equations and two unknown power series 
U and V rather than a single equation with a single unknown power series F(t; x, 0); this difference 
originates from the lack of symmetry of G(t; x, y) with respect to x and y, which itself comes from 
the asymmetry of the Gessel step set with respect to the main diagonal of N 2 . Second, the two 
equations for U and V still contain G(£;0,0) while there is no term ^(^0,0) present in (K re( j); 
this difference originates from the fact that Gessel's step set contains the admissible step / . 
as opposed to Kreweras's step set. The occurrence of G(t; 0, 0) in the equations (K j?) is not 
really problematic, as we know this power series explicitly thanks to Theorem 1. As for the other 
difference, we need the following variation of Lemma 4. 

Lemma 7. Let Ai,A2,B 1 ,B 2 ,Y 1 ,Y 2 € Q[x, x^ 1 ] [[£]] be such that ord t B x > 0, ord t B 2 > 0, 
ordt Y\ > and ordt Y 2 > 0. Then there exists at most one pair (U\, U 2 ) € Q[[x, i]] 2 with 

Ui(t,x) =A 1 (t,x)+B 1 (t,x) ■ U 2 (t,Yr(t,x)), 
U 2 (t,x) =A 2 (t,x)+B 2 (t,x) ■ C/i(i,F 2 (t,x)). 

Proof. By linearity, it suffices to show that the only solution (U\, U 2 ) in Q[[x, t\] x Q[[x, t]] of the 
homogeneous system 

U!(t,x) =B 1 (t,x)-U 2 (t,Y 1 (t,x)), 
U2(t,x) = B 2 (t,x)-U 1 (t,Y 2 (t,x)) 

is the trivial solution (Ui, U 2 ) = (0, 0). This is a direct consequence of the fact that if both U\ and 
U 2 were non-zero, then the valuation of Bi(t, x) ■ U 2 (t, Y\(t, x)) would be strictly greater than the 
valuation of U 2 (t,x), and the valuation of B 2 (t,x) ■ Ui(t,Y 2 (t,x)) would be strictly greater than 
the valuation of Ui(t,x), thus ordt(£/i) > ordt(U 2 ) > ordt([/i), a contradiction. Therefore, one of 
Ui, U 2 is zero, and the system then implies that both are zero. □ 

By a slightly more careful analysis, the lemma could be refined further such as to show that 
there is only one triple of power series (U,V,G) with U, V £ Q[[x,i]] and G G Q[[i\] (free of x) 
which satisfies (K *?) with G(t; 0, 0) replaced by G. In this version, the proof could be completed 
without reference to the independent proof of Thm. 1 . 

Either way, we can in principle proceed from this point as in Section 2. Out of convenience, we 
choose to regard G(t; 0, 0) as known. Again, we divide the remaining task in two steps: 
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(1) Guess defining algebraic equations for U(t, x) and V(t, x), by inspecting the initial terms 
of G(t; x, 0), resp. of G(t; 0, x). 

(2) Prove that 

(a) each of the guessed equations has a unique solution in Q[[x, t]], denoted U can d(t; x, 0), 
resp. V ca , nd {t;x,0); 

(b) the power series C/ ca nd and T4 an( j indeed satisfy the two equations in (K '?). 

Once this has been accomplished, Lemma 7 implies that the candidate series are actually equal 
to U and V, respectively, and so these series as well as G(t; x, 0) and G(t; 0, y) are in particular 
algebraic. Then equation (K G ) implies that G(t; x, y) is algebraic, too. This then completes the 
proof of Thm. 3. 

3.1. Guessing. In the beginning, we had no reason to suspect that G(t;x,y) is algebraic, since 
even the specialization G(t; 0, 0) was generally thought to be transcendental. 

Motivated by the case x = y = (i.e., by Thm. 1, which was merely a conjecture by that 
time), we wanted to find out whether G(t;x,y) has chances to be D-finite with respect to t, and 
searched for linear differential equations with polynomial coefficients potentially satisfied by its 
sections G(t; x, 0) and G(t; 0, y). With such equations at hand, we could have, in principle, proven 
the D-finitencss of G(t; x, y), by very much the same reasoning that we apply here for proving that 
G(t; x, y) is algebraic. 

We realized quickly that the differential equations for G(t; x, 0) and G{t; 0, y), if they exist, are 
too big to be caught by the guessers implemented in packages like Maple's gfun or Mathematical 
GeneratingFunctions. In order to gain efficiency, we switched to Magma, which provides efficient 
implementations of low- level algorithms, and we opted for applying a modular approach: we set x 
and y to special values xo, yo = 1, 2, 3, . . . , and in addition, we kept numerical coefficients reduced 
modulo several fixed primes p to avoid the emerging of large rational numbers. Modulo a prime p, 
and starting from the first 1000 terms of the series G(t; x, 0) and G(t; 0, y), we used a very efficient 
automated guessing scheme, relying on the Bcckcrmann-Labahn (FFT-based) super-fast algorithm 
for computing Hermite-Pade approximants [1]. Eventually, we made the following observations: 

• For any choice of p and xq, there are several differential operators in Z p [t](D t ), of order 14 
and with coefficients of degree at most 43, which seem to annihilate G(t; xq, 0) in Z p [[t]]. 

• For any choice of p and yo, there are several differential operators in 7i p [i\(D t ), of order 15 
and with coefficients of degree at most 34, which seem to annihilate G(t; 0, yo) in Z p [[£]]. 

(Here, and hereafter, D t stands for the usual derivation operator and for any ring R, we denote 
by R[t](Dt) the Weyl algebra of differential operators with polynomial coefficients in t over R.) 

The next idea was to apply an interpolation mechanism in order to reconstruct, starting from 
guesses for various choices of xq and yo, and modulo various primes p, two candidate operators: 
one in Q[x,t](D t ) that would annihilate G(t;x,0) in Q[a:][[t]], and the other one in Q[y,t](D t ), 
that would annihilate G(t;0,y) in Q[y][[i]]. The ingredients needed to put such an interpolation 
scheme into practice are rational function interpolation and rational number reconstruction. Both 
are standard techniques in computer algebra, for details on fast algorithms we refer to [13]. 

To our surprise, when applied to the order 14 and 15 differential operators mentioned above, 
we found this reconstruction scheme to require an unreasonably large number of evaluation points 
xo,yo = 1,2,3,..., suggesting an unreasonably high degree of the operators with respect to x or y, 
respectively. We aborted the computation when the expected degree exceeded 1500 (!). At this 
point, we had the impression that the section series G(t; x, 0) and G(t; 0, y) might not be D-finite. 

Our next attempt was to find candidate operators of smaller total size by trading order against 
degree. We went back to the series G(i;xo,0) modulo p, and tried to determine the least order 
operator JC^ G Z p [t](D t ) annihilating it. This was done by taking several candidate operators 
in Z p [t](D t } of order 14 as above, and by computing their greatest common right divisor (gcrd) 
in the rational Weyl algebra Z p (t)(D t ). (Despite the non-commutativity of Z p (t){D t ), this can be 
done by a variant of the Euclidean algorithm [25, 9], see [15] for a more efficient grcd algorithm.) 
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We applied the same strategy to find the least order operator Cq Vo £ Z p [t](D t ) annihilating the 
series G(t; 0, yo) modulo p. 

Doing so for several evaluation points Xq, yo = 1, 2, 3, . . . and several choices of p, it was finally 
possible, by using the interpolation scheme described above, to reconstruct from the various mod- 
ular candidate operators C^ and C p ^ ya , two candidates C x .o S Q[x,t](D t ) and Co, y G Q[y,t](D t ) 
with reasonable degrees in x and y. 

The operators L Xy o and Lo, y are posted on our website [4]. The operator C X: o has order 11, 
degree 96 with respect to t and degree only 78 (!) in x. Its longest integer coefficients has only 61 
decimal digits. The operator Co, y is even nicer. Its order is 11, its degree is 68 with respect to t 
and just 28 (!) with respect to y. Its longest integer coefficient has 51 decimal digits. 

The whole procedure for guessing C x $ and Co, y took less than 2 CPU hours on a modern 
computer running Magma v 2.13 (12). To achieve this speed, we greatly benefited, on the one 
side, from the fast Magma's buit-in polynomial, integer and modular arithmetic, and on the 
other side, from our own efficient implementations of several algorithms (e.g., for Hermite-Pade 
approximation, for rational function interpolation and for grcds). 

For instance, to compute C x $ we used 21 primes p of 28 bits each and 158 distinct integer 
values of xq. Modulo each prime p, 150 CPU seconds were enough to compute: 158 bundles of 
four differential operators in 1 p [t](D t ) with order 14 and with coefficients of degree at most 43 
(by Hermite-Pade approximation), 158 operators in Z p [t](D t ) of order 11 and with coefficients of 
degree at most 96 (by right grcd) and 12 x 97 = 1164 rational functions in Z p (x) with numerators 
and denominators of degree at most 78 (by rational interpolation). At this point, the operator 
contained 97 x 79 x 12 = 91956 terms of the form a tjfk t z x j D^ (i < 96, j < 78, k < 11), where each 
c i,j,k € Q was known modulo the 21 primes. The constants Cij,k were recovered by performing 
91956 rational number reconstructions. The whole computation of C x $ took 55 minutes. Guessing 
Co, y using the same method was even a little bit faster. 

The exceptionally small sizes of C x $ and Co, y (in comparison to the intermediate expressions) 
speak very much in favor of their correctness. Also, the fact that the operators C x $ and Co, y 
verify the following equalities in Q[[t]]: 

£ x ,o(G(t; x, 0)) mod t lmo = and C , y (G(t; 0, y)) mod t 1000 = 0, 

provides more empirical evidence that C x $ and Cq^ v are indeed annihilating operators for G(t] x, 0) 
and G(t;0,y), respectively. 

There are a number of additional tests which can be performed to experimentally sustain the 
evidence that a guessed differential operator is correct (see our paper [5] for a collection of such 
tests), and our operators C x ^o and Co. y successfully pass all these tests. 

One of the tests consists of checking whether the operators C Xt o and Co.y possess an arithmetic 
property which is expected from the minimal order operator annihilating a generating function 
like G(t;x,0) and G(t;0,y), see [5]. This property, called global nilpotency [11], can be stated as 
follows: for almost any prime number p, the order 11 operators C Xt o, resp. Co.y, should right- 
divide the pure power Dl l p in Z p (x,t)(D t ), resp. in Z p (y,t)(D t ). We checked that this property 
indeed holds for all primes p < 100. We actually found out that the operators C x .o and £o,y have a 
stronger property: they even right-divide D P ; in other terms, they have zero p-curvature for all the 
tested primes p. This was the key observation which led us to suspect that G(t\ x, y) is algebraic, 
for according to a famous conjecture of Grothcndicck [28], an operator has zero p-curvature if and 
only if it admits a basis of algebraic solutions. The conjecture is still open (even for second order 
operators), but it is generally believed to be true. In either case, something interesting is going on: 
either G(t; x, y) is algebraic or we have found operators which very much look like counterexamples 
to Grothendieck's conjecture. 

We next searched for potential polynomial equations satisfied by the power series U, V £ Q[[x, t]} 
defined by G(t; x, 0) = G(t; 0, 0) + xll(t, x) and G(t; 0, x) = G(t; 0, 0) + xV(t, x). We did not find 
any using only 1000 terms of those series, but we found some starting from 1200 terms. Using 
again guessing techniques based on fast modular Hermite-Pade approximation, combined with 
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an interpolation scheme, we discovered two polynomials P\(T, t, x) £ Q[T,t,x] and P2(T,t,y) £ 
Q[T,t,y] which satisfy 

Pi(U(t,x),t,x) = Omodi 1200 and P 2 (V(t, y), t, y) = mod t 1200 . 

These polynomials arc posted on our website [4]. The polynomial Pi has degrees 24, 44, and 32 
with respect to T, t, and x, respectively, and involves integers with no more than 21 decimal 
digits. The polynomial P^ has degrees 24, 46, and 56 with respect to T, t, and y, respectively, and 
involves integers with no more than 27 decimal digits. Spelled out explicitly in this article, they 
would both together fill about thirty pages; they are however much smaller than the differential 
operators C x ,o and £o. y , for which five hundred pages would not be enough! Just like C x $ and 
£o,yj the polynomials P\ and P2 pass a number of heuristic tests which let them appear plausible. 

We are now going to prove that the guessed polynomials P± and P2 are indeed valid. 

3.2. Proving. Let Pi £ Q[T,t,x] and P2 £ Q[T,t,y] be the two polynomials posted on the 
website to this article [4] . We show (i) that these polynomials admit unique power series solutions 
U ca ,nd(t, x) and V can d(t,x), respectively, and (ii) that these power series satisfy the reduced kernel 
equations (K^' d 2 ). 

3.2.1. Existence and Uniqueness. As in the case of Kreweras's walks, the implicit function theorem 
docs not apply to these polynomials, but unlike in the Kreweras case, an existence proof using a 
suitable rational parameterization is not possible either, because the polynomials at hand define 
curves of positive genus, and therefore a rational parameterization does not exist. 
In order to obtain a proof in this situation, we proceeded along the following lines: 

• First we used Theorem 3.6 of McDonald [22] to obtain the existence of a series solution 

^ ^ Cp^qt^ X^ 

with c VA = for all (p, q) outside a certain halfplanc H C Q 2 . 

• Next, we computed a system of bivariate recurrence equations with polynomial coefficients 
that the coefficients c p q must necessarily satisfy. This can be done in principle by soft- 
ware packages such as Chyzak's mgfun [10] or Koutschan's HolonomicFunctions .m [18]. 
However, for reasons of efficiency we used our own implementation of the respective algo- 
rithms. 

• The form of the recurrences together with the shape of the halfplanc H imply that the 
coefficients c Pi(j of any solution can be nonzero only in a finite union of cones v + Nu + Nw 
with vertices v £ Q 2 and basis vectors u, w £ Q 2 that can be computed explicitly. If 
Cp,q 7^ for some index (p, q) in such a cone, then also the coefficient at the cone's vertex 
must be nonzero. 

• Applying McDonald's generalization of Puiseux's algorithm, we determined the first co- 
efficients of series solutions to an accuracy that all further coefficients belong to some 
translate of H which contains no vertices. 

• As one of these partial solutions contained no terms with fractional powers, it was possible 
to conclude that the entire series contains no terms with fractional exponents. Reference to 
u and w implied that this partial solution could also not contain any terms with negative 
integral exponents, so the only remaining possibility was that the solution is in fact a 
power series. 

A full description of the argument requires a somewhat lengthy discussion of a number of 
technical details, which we prefer to avoid here. A supplement to this article is provided on our 
website [4] in which we carry out existence proofs in full detail that both Pi and P2 admit some 
power series solutions ?7 C and and V^ an d, respectively. 
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3.2.2. Compatibility with the reduced kernel equation. It remains to show that these solutions U C!md 
and V cand satisfy the system (K re ^). Because of X(t, Y(t, x)) = x, the substitution x — ► Y(t,x) 
transforms the second equation of that system to the first. Therefore, it suffices to prove the 
second equation: 

(2) (l + x)xV cand (t;x,0) = X(t,x)x/t-{l + x)G(t;0,0)-X(t,x)U cand {t;X(t,x),0). 

Letting Gi(t,x) = G{t; 0, 0) + xll cand (t; x, 0) and G 2 (t, x) = G(t; 0, 0) + xV cand {t; x, 0), the last 
equation is equivalent to 

(3) (l + x)G 2 (t,x) - G(t;0,0) = xX(t,x)/t- G^X^x)). 
By Corollary 2 and Lemma 6, the power series 

{l + x)G 2 (t, x) -G(i;0,0) and xX(t, x)/t - Gi(t, X(t, x)) 

are algebraic and we can compute their minimal polynomials — at least in theory. Now the poly- 
nomials P\ and P 2 are so big that the required resultant computations cannot be carried out by 
Maple or Mathematica. 

There are efficient special purpose algorithms available for the particular kind of resultants 
at hand [3] and our Magma implementation of these algorithms is able to perform the necessary 
computations. It turns out that the minimal polynomials for both power scries arc identical. It 
is provided electronically on the website to this article. After determining a suitable number of 
initial terms of both series and observing that they match, it can be concluded that Equations (3) 
and (2) hold. This completes the proof of Theorem 3. 

3.3. Consequences. The fact that G(t;x,y) is algebraic has consequences which are of combi- 
natorial interest. We list some. 

Corollary 8. The following series are algebraic: 

• G(i; 1, 1) - the generating function of Gessel walks with arbitrary endpoint. 

• G(i; 1,0) and G(i;0, 1) - the generating functions of Gessel walks ending somewhere on 
the x-axis or the y-axis, respectively. 

Using the built-in equation solver of Mathematica, we found that all these series, as well as the 
series G(t;0,0), can be expressed in terms of nested radicals, for example 



G(t;l,l) = — -3 + V3a 



16t(2t + 3) + 2 TJU . 2 



ar jTv y wr | Hp) 



where U(t) = \j 1 + 4{/t(4t + 1)7(4* - l) 4 . 

The radical representations of the other series are much more involved than this one. They are 
available electronically at the website to this article [4]. Also their minimal polynomials can be 
found there. 

Corollary 9. For every point the series Gij(t) := ~Yl^ = Qg{n]i, j)t n is algebraic. 

Proof. We have 



1 / d l d? , 

G «® = m(*?w G{tiX ' y) 



x=y=0 



and the property of being algebraic is preserved under differentiation and evaluation. □ 

The previous corollary implies in particular that the conjecture of Petkovsek and Wilf [26] that 
g(n; 0, j) (for fixed j) and g(n; 1, 0) are P-fmite is right, and that their conjecture that g(n; 2, 0) is 
not P-fmite is wrong. 
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For the degrees of the minimal polynomials pi_j(T,t) of Gjj(i), we observed empirically that 

^ „.._/ 4 ifi = 2j + l 



but we are not able to prove these degree formulas in general. 

Note that our proof of Theorem 3 does not provide us with the minimal polynomial of G(t; x, y). 
This polynomial will, in fact, be much larger than the minimal polynomials of the series Gij(t) 
or the series obtained form G(t;x,y) by setting x, y or t to special values. From the sizes of 
the minimal polynomials of G(t;x, 0) and G(t;0,y), which we know explicitly, it can be deduced 
that the minimal polynomial p(T, t, x, y) of G(t;x,y) will have degrees 72, 141, 263, and 287 with 
respect to T, t, x, and y, respectively, and thus consist of more than 750 Mio terms. 

Corollary 10. G(t;x,y) is D-finite with respect to any of the variables x,y and t. 

As every algebraic power series is D-finite, this is an immediate consequence of Theorem 3, even 
if we regard G(t; x, y) as a multivariate power series in t, x, and y rather than as a power series in 
t only with x and y belonging to the coefficient domain. 

D-finitcness in t only amounts to the existence of a linear differential equation in d/ alt with 
coefficients in Q(x, y)[t]. This can be proven independently in as similar way as Theorem 3. It 
suffices to discover differential operators which potentially annihilate U{t,x) and V(t,y), respec- 
tively, define J7 can( j(t, x) and V can d(t,y) as the unique power series annihilated by these operators 
and matching the first terms of U (t, x) and V(t, y), respectively, and prove that these series satisfy 
the equations in (K^ d ). 

The option of proving D-finiteness (with respect to t) directly is important when other step 
sets instead of Gessel's {<—, — /, /} are considered, for which the generating function G(t; x, y) 
is D-finite in t but not algebraic. As shown by Mishna [23], such step sets do exist. 

Explicit knowledge of differential operators annihilating G(t; x, 0) and G(t; 0, y) also allows to 
deduce bounds on the size of the differential operator annihilating G(t; x, y). According to a priori 
estimations, this operators will have order up to 22 and polynomial coefficients with degrees up 
to 1968, 936, and 336 with respect to t, x, and y, respectively, and thus consist of about 1.4 • 10 10 



Corollary 11. For fixed i and j, the number g(n;i,j) can be computed with 0{n) arithmetic 
operations. 

For fixed x andy, the coefficient (t n )G(t; x, y) can be computed with 0{n) arithmetic operations. 

Proof. By Cor. 10, the coefficient sequence g(n;i,j) is P-finite with respect to n. Therefore, it 
satisfies a uniform recurrence with respect to n. This recurrence, together with appropriate initial 
values, allows the computation of g{n;io,jo) in linear time. 

The argument for the second assertion is similar. □ 
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